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'notion  of  a  solution  space  was  developed.  There  was  a  time,  not  too  long  ago, 
when  researchers,  especially  those  in  artificial  intelligence,  thought  that 
effective  problem  solving  might  be  due  mainly  to  general  strategies  for  guid¬ 
ing  search  through  problem  spaces.  Early  attempts  to  create  computer  programs 
to  solve  problems,  such  as  the  General  Problem  Solver,  took  this  approach. 

However,  this  straightforward  picture  of  problem  solving  has  recently  been 
shown  to  be  far  too  simple.  Specific  knowledge  of  a  domain  is  of  overriding 
importance  in  the  effective  solution  of  problems.  In  addition,  this  knowledge 
must  be  well-structured  so  that  relevant  knowledge  can  be  accessed  at  the  pro¬ 
per  time.  Research  is  beginning  to  uncover  just  what  "well-structured"  means, 
but  considerable  work  is  left  to  be  done  on  the  problem  of  how  we  are  able  to 
retrieve  and  use  information  in  a  rapid  and  effective  manner./^ 

Even  more  work  needs  to  be  done  on  the  kinds  of  problems  that  are  probably  the 
most  important  to  us  —  ill-defined,  real-world  problems.  Ill-structured 
problems  are  those  which  have  one  or  more  aspects  of  the  problem  unspecified. 
Designing  a  house  would  be  an  example.  Very  little  is  yet  understood  about  the 
nature  of  the  representation  and  solution  processes  of  ill-defined  problems. 
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Solving  problems  is  a  complex  cognitive  skill  that  characterizes 
one  of  the  most  intelligent  human  activities.  From  childhood  onward, 
we  actively  solve  problems  presented  to  us  by  the  world.  We  acquire 
information  about  that  world  and  organize  this  information  into 
structures  of  knowledge  about  objects,  events,  people,  and  ourselves 
that  are  stored  in  our  memories.  These  structures  of  knowledge 
comprise  bodies  of  understanding,  "mental  models,"  convictions  and 
beliefs  that  influence  how  we  relate  our  experiences  together,  and  how 
we  solve  the  problems  that  confront  us  in  everyday  life,  in  school,  in 
our  jobs,  and  at  play. 

How  do  humans  develop  their  abilities  to  solve  problems  in  these 
situations?  People  differ,  children  from  adults,  and  experts  from 
novices,  and  these  differences  are  based  upon  cognitive  processes  and 
mental  organizations  that  humans  have  in  common  that  characterize 
their  problem-solving  abilities.  In  this  chapter  we  emphasize  the 
latter  —  the  general  characteristics  of  human  problem-solving 
ability.  Systematic  theory  on  the  mechanisms  of  human  problem  solving 
is  a  relatively  recent  advance  in  psychological  science,  and  knowledge 
of  fundamental  processes  provides  a  basis  for  understanding  the 
development  and  acquisition  of  our  abilities  to  think  and  solve 
problems. 
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Scientists  interested  in  problem  solving  have  studied  various 
classes  of  problems,  each  having  its  own  task  characteristics,  which 
determine  to  a  large  extent  the  behavior  of  the  problem  solver  and  the 
strategies  appropriate  to  solution.  In  addition,  the  experience  and 
knowledge  brought  to  the  situation  by  the  problem  solver  determines 
whether  and  how  a  solution  will  be  reached.  Puzzle  problems,  like 
tic-tac-toe  or  combining  the  links  of  several  small  chains  into  a^ 
larger  chain  with  the  minimal  number  of  moves,  require  little 
knowledge  of  a  subject-matter  domain.  Solving  problems  in  elementary 
physics  requires  more  subject-matter  knowledge,  such  as  knowledge  of 
force  diagrams  and  of  certain  laws  of  physics.  Puzzle  problems  have 
been  studied  to  a  large  extent  because  they  are  uncomplicated  by  much 
necessary  background  knowledge  and  reveal  the  strategies  that  people 
employ  in  searching  for  a  solution.  Detailed  observations  of 
performance  in  puzzle  situations  have  been  accompanied  by  computer 
simulations  of  these  performances  that  precisely  describe  certain 
general  strategies  or  heuristics  people  use  when  they  are  confronted 
with  novel  situations.  As  a  result  of  extensive  work  with  these 
computer  models  of  problem  solving,  the  main  mechanisms  for  solving 
these  well-structured  puzzle  problems  are  fairly  well  understood.  The 
strategies  used  depend  upon  attention  to  perceptual  cues,  the  goals 
and  subgoals  held  in  memory,  and  the  discovery  of  sequential  patterns 
of  correct  moves. 

Research  has  also  been  carried  out  on  the  nature  of  expert 
problem  solving  in  knowledge-rich  domains  like  chess  playing  and 
school  problems  in  physics  and  mathematics.  Investigation  of  the 
performance  of  experts  and  novices  in  domains  requiring  extensive 
knowledge  has  deepened  our  understanding  of  the  kinds  of  knowledge 


required  for  efficient  problem-solving  ability.  In  particular,  the 
investigation  of  problem  solving  in  domains  requiring  extensive 
knowledge  has  shown  how  the  knowledge  organizations  acquired  by  the 
problem  solver,  that  are  stored  in  long-term  memory,  influence  the 
perceptual  processes  and  strategies  of  problem  solving. 

Two  important  factors,  then,  that  influence  problem  solving,  are 
the  nature  of  the  task  (the  task  environment)  and  the  kind  of 
knowledge  brought  to  the  problem  by  the  solver.  These  two  aspects 
dictate  the  organization  of  this  chapter.  In  the  first  main  section, 
we  consider  puzzle  problems  and  general  processes  of  solution,  and  in 
the  second,  we  discuss  problem  solving  involving  domain  knowledge.  We 
also  consider  various  task  environments  that  involve  insight, 
creativity,  and  ill-structured  problems.  Our  goal,  in  the 
problem-solving  task  of  writing  this  chapter,  is  to  give  the  reader  an 
understanding  of  some  of  the  mechanisms  that  underlie  the  problem 
solving  abilities  of  their  species.  We  begin  by  defining  the  various 
types  of  problems  that  have  been  studied. 

What  Is  a  Problem? 

A  problem  is  a  situation  in  which  you  are  trying  to  reach  some 
goal,  and  you  must  find  a  means  for  getting  there.  Figuring  out 
puzzles,  solving  algebra  problems,  deciding  how  to  budget  a  limited 
amount  of  money,  trying  to  control  Inflation  and  reduce  unemployment 
—  these  are  all  examples  of  problems  that  we  as  individuals  and  as  a 
society  encounter  all  of  the  time.  Clearly,  these  problems  cover  an 
enormous  range  of  difficulty  and  complexity,  but  they  do  have  some 
things  in  common.  They  all  have  some  initial  state ,  whether  it  is  a 


Page  4 


set  of  equations  or  the  state  of  the  economy,  and  they  all  have  some 
goal .  To  solve  the  problems,  it  is  necessary  to  perform  some 
operations  on  the  initial  state  to  achieve  that  goal.  Often,  there 
are  some  rules  that  specify  allowable  operations,  and  these  are 
generally  called  constraints . 


Puzzles  are  one  form  of  problem  with  which  we  are  all  familiar. 
The  Tower  of  Hanoi  puzzle  (shown  in  Fig.  1)  is  a  typical  example  and 
has  received  a  great  deal  of  attention  from  psychologists.  It 
consists  of  three  pegs  and  a  set  of  disks  of  different  sizes.  In  the 
initial  state,  the  disks  are  stacked  up  on  the  first  peg  (the  top 
display  in  Fig.  1)  in  order  of  decreasing  size  (like  a  pyramid),  and 
the  goal  is  to  move  all  the  disks  onto  the  third  peg,  maintaining  the 
pyramid  (the  bottom  display  in  Fig.  1).  The  constraints  in  this 
problem  situation  are  that  the  solver  can  never  place  a  larger  sized 
disk  on  top  of  a  smaller  one  and  that  only  one  disk  may  be  moved  at  a 
time. 


Although  puzzles  are  interesting,  the  kinds  of  problems  that  we 
frequently  encounter  fall  into  two  types:  classroom  problems  and 
real-world  problems.  An  algebra  problem  where  we  have  to  find  the 
unknown  quantity  typifies  our  standard  notion  of  a  problem.  Students 
spend  a  good  deal  of  time  learning  to  solve  various  kinds  of  classroom 
problems  in  such  areas  as  mathematics,  physics  and  chemistry.  A 
distinct  difference  between  puzzles  and  classroom  problems  is  that  a 
fair  amount  of  knowledge  of  a  specific  subject  area  is  necessary  for 
the  solution  of  the  classroom  problems.  Whereas  even  a  young  child 
can  move  a  disk  from  one  peg  to  another,  solving  an  algebra  problem 


■ :  i 


manipulating  equations.  Often  the  most  important  and  difficult 
problems  we  have  to  solve  are  those  we  encounter  in  everyday  life 
outside  classroom.  Such  problems  range  from  finding  our  way  about  in 
a  city,  to  elevating  ourselves  out  of  a  period  of  depression.  In  both 
of  these  cases,  we  have  a  clear  initial  state  (being  lost  at  location 
A  or  feeling  depressed),  and  a  clear  goal  (wanting  to  be  at  location 
B,  or  feeling  elated).  The  solution  for  the  first  problem  might  be 
either  calling  a  taxi,  asking  directions,  or  reading  a  map. 

Puzzles,  classroom,  and  the  real-world  problems  described  above 
are  all  well-defined.  This  means  that  you  can  readily  recognize  when 
the  problem  is  solved.  We  know  when  the  Tower  of  Hanoi  problem  is 
solved  when  the  disks  have  all  been  transferred  to  the  third  peg.  An 
algebra  problem  Is  solved  when  a  quantity  for  the  unknown  is  found 
(although  it  could  be  the  wrong  answer).  Likewise,  we  know  when  we 
feel  less  depressed,  and  we  know  that  we  have  accomplished  our  goal  of 
finding  our  way  when  we  are  at  location  B. 

There  is  another  class  of  problems,  called  ill-defined.  It 
refers  generally  to  problems  where  one  or  several  aspects  of  the 
situation  is  not  well  specified.  Examples  are  composing  a  poem,  or 
designing  a  house.  The  general  nature  of  these  problems  is  that  the 
description  of  the  problems  is  not  clear,  the  Information  needed  to 
solve  them  is  not  entirely  contained  in  the  problem  statement,  and 
consequently,  it  is  even  less  obvious  what  actions  to  take  in  order  to 
alleviate  the  problem. 
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Puzzle  Problems  and  Processes  of  Solution 

Well-structured  puzzle  problems  have  been  traditionally  studied 
by  psychologists  and  have  received  particular  attention  in  the  last 
two  decades.  Their  popularity  arose  from  two  related  considerations. 
First,  with  the  advent  of  computer  science  and  its  accompanying 
notions  and  techniques  of  computer  simulation  of  human  behavior 
(Newell,  Shaw,  &  Simon,  1958),  psychologists  were  particularly 
interested  in  rigorous  investigation  of  the  basic  processes  of  solving 
problems.  The  technique  used  was  to  have  a  subject  "think  out  loud" 
while  solving  a  given  problem.  This  problem  solving  protocol  is  tape 
recorded,  transcribed,  and  then  intensely  analyzed  to  determine  just 
how  the  subject  tried  to  solve  the  problem.  The  problem  solving 
strategies  used  by  the  subject  is  then  simulated  in  a  computer 
program,  to  see  if  the  program  can  produce  similar  solution  patterns. 
Second,  puzzle  problems  require  very  little  background  knowledge  and 
yet  they  can  be  very  difficult  to  solve.  In  such  tasks,  differences 
in  individual's  abilities  to  solve  problems  can  be  attributed  to 
differences  in  some  basic  underlying  problem  solving  processes  rather 
than  to  more  extensive  subject  matter  knowledge.  Thus,  puzzle 
problems  lend  themselves  particularly  well  to  uncovering  the 
underlying  solution  processes,  which  generally  refer  to  the  trace  of 
the  sequence  of  operations  applied  to  transform  the  initial  state  to 
the  goal  state. 

Contrast  the  focus  of  modern  cognitive  psychologists  on  processes 
with  the  earlier  problem  solving  research  by  Gestalt  psychologists  of 
the  1930s.  The  early  work  dealt  mostly  with  "insight"  problems,  and 
emphasized  how  appropriate  changes  in  the  representation  of  a  problem 


could  lead  to  solution.  A  typical  insight  problem  is  the  Two-String 
problem  (Maier,  1931).  Two  strings  are  hung  from  the  ceiling.  The 
object  is  to  tie  them  together,  but  they  are  too  far  apart  for  a 
person  to  reach  both  of  them  at  the  same  time.  On  the  table  nearby 
are  a  book  of  matches,  a  few  pieces  of  cotton,  and  a  screwdriver.  In 
solving  this  problem,  the  insight  comes  in  recognizing  that  the 
screwdriver  can  be  used  differently  than  in  its  usual  function.  In 
this  case,  it  may  be  tied  to  one  string,  to  create  a  pendulum  that  can 
be  swung  to  the  other  string.  This  early  research  on  problem  solving 
focused  on  the  conditions  that  impeded  or  facilitated  problem  solving 
and  how  intuitively  improbable  responses  could  become  more  probable. 
For  example,  would  embedding  the  screwdriver  among  a  different  set  of 
items  on  the  table  make  it  easier  for  the  solver  to  recognize  (and 
break  the  functional  set)  that  a  screwdriver  can  be  used  in  an 
unfamiliar  way,  as  a  weight. 

The  information  processing  approach,  represented  by  the  work  of 
Newell  and  Simon  (1972),  significantly  changed  problem  solving 
research  by  turning  attention  away  from  the  conditions  under  which 
solutions  can  be  reached  to  the  component  cognitive  processes  involved 
in  the  performance  that  transform  the  initial  state  of  the  problem  to 
the  final  goal  state.  In  order  to  make  this  problem  a  manageable  one 
to  study,  many  researchers  in  the  late  sixties  and  early  seventies 
concentrated  on  a  class  of  puzzle-like  problems  called  MOVE  or 
TRANSFORMATION  problems.  These  problems  all  have  clear  initial  and 
goal  states,  and  a  small  set  of  well  defined  operations  (moves)  that 
can  successively  transform  the  initial  state  to  che  goal  state. 


The  Tower  of  Hanoi  is  one  such  problem  which  has  been  extensively 
studied.  A  spatial  metaphor  is  useful  in  analyzing  this  kind  of 
problem.  The  states  of  the  problem  are  represented  as  points  in  a 
space.  The  possible  operations  on  each  state  are  represented  by  lines 
leading  to  the  states  that  these  operations  produce.  Because  there  is 
only  one  initial  state  and  there  is  typically  more  than  one  operation 
which  can  be  applied  to  each  state  to  generate  a  number  of  different 
transformations,  this  solution  space  resembles  an  upside  down  tree. 
Part  of  the  solution  space  for  the  Tower  of  Hanoi  is  shown  in  Figure 
1. 


One  way  to  look  at  the  process  of  solving  a  problem  is  that  it  is 

a  search  through  the  solution  space.  The  space  contains  many  possible 

paths,  but  only  one  (or  a  few)  leads  to  the  point  that  represents  the 

goal  state.  The  distance  travelled  down  a  particular  branch  of  the 

tree,  the  levels  in  Figure  1,  is  often  referred  to  as  the  depth  (D)  of 

search,  and  the  number  of  alternatives  at  a  point  is  the  breadth  (B). 

D 

The  total  number  of  possible  paths  is  equal  to  B  (if  B  is  the  same 

number  at  every  state).  As  you  can  imagine,  the  number  of  alternative 
paths  to  be  considered  explodes  exponentially. 

Representation 

The  representation  of  a  problem  consists  essentially  of  the 
solver's  interpretation  or  understanding  of  the  problem.  Researchers 
have  found  that  Che  representation  is  very  Important  in  determining 
how  easy  a  problem  is  to  solve.  In  the  two  string  problem,  for 
Instance,  insight  is  really  rerepresentation.  The  initial 


representation  of  the  screwdriver  must  be  broadened  to  include  the 
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face  chat  it  is  a  heavy  object,  which  can  therefore  be  used  as  a 
pendulum  weight. 

The  representation  of  MOVE  problems,  like  the  Tower  of  Hanoi,  is 
fairly  straightforward  because  the  initial  and  final  states  and  the 
|  permissible  moves  are  clearly  spelled  out.  However,  it  is  possible  to 

conceive  of  situations  where  the  representation  fails  to  embody  one  or 
several  aspects  of  the  problem.  For  example,  if  the  constraints  on 
|  the  operations  are  not  properly  encoded,  then  the  solution  space  could 

be  enlarged  unnecessarily.  Take  the  case  in  which  the  Tower  of  Hanoi 
problem  is  solved  by  children.  Children  may  have  trouble  remembering 
the  constraint  that  a  larger  disk  may  not  be  placed  on  top  of  a 
smaller  disk,  or  they  may  forget  what  the  goal  state  should  look  like 
(Klahr  &  Robinson,  1982).  If  the  children  forget  a  constraint,  they 
effectively  add  branches  to  the  solution  space,  making  the  correct 
path  harder  to  find.  The  resulting  search  space  is  referred  to  as  the 
solver's  problem  space .  The  dotted  path  from  State  A  to  State  B  in 
Figure  1  shows  one  possible  illegal  addition  to  the  solution  space. 

I 

In  certain  problems,  besides  the  omission  of  constraints,  one 
could  also  add  unnecessary  constraints,  thereby  taking  away  branches 
so  that  the  proper  solution  path  is  not  even  present  i ..  the  problem 
space.  The  problem  thus  creates  an  impossible  one.  Adding  or  taking 
away  branches  due  to  an  improper  representation  is  a  common 
occurrence.  One  example  of  adding  a  constraint  to  the  representation 
is  to  be  seen  in  the  problem  of  drawing  a  continuous  line  through  nine 
dots  arranged  as  three  rows  of  three  without  any  retracing  (see 
Fig.  2a).  Almost  invariably,  solvers  unconsciously  add  the  constraint 
that  the  line  cannot  go  outside  of  the  square  indicated  by  the  dots. 
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as  shown  In  Fig.  2b. 

Besides  adding  or  forgetting  constraints,  the  representation  of  a 
problem  can  also  be  faulty  if  solvers  encode  the  goal  state 
improperly.  This  poses  an  impossible  problem  because  then  solvers 
will  not  know  what  to  search  for.  An  illustration  of  a  problem  in 
which  the  goal  state  is  improperly  encoded  is  the  well-known  sixteen 
matches  problem  (see  Fig.  3a).  The  goal  is  to  form  four  squares, 
given  the  constraint  that  only  three  matches  may  be  moved.  One  of  the 
common  causes  of  error  is  that  the  goal  is  often  represented  as  shown 
in  Fig.  3b,  instead  of  arrangements  like  those  shown  in  Fig.  3c,  where 
each  square  uses  up  four  sticks.  That  is,  the  initial  representation 
of  the  goal  does  not  include  the  information  that  sixteen  sticks  can 
make  up  four  squares  perfectly,  which  means  that  no  stick  should  be 
shared  by  two  squares.  (Another  way  to  look  at  this  is  that  the 
representation  did  not  contain  the  constraint  that  no  stick  should  be 
shared  by  two  squares.) 

The  difficulty  in  solving  MOVE  problems  is  not  generally  related 
to  misrepresentations  of  the  initial  state  but  rather  omitting 
constraints  or  not  having  a  clear  representation  of  the  goal  state,  as 
described  above.  However,  other  puzzle  problems,  such  as  the 
two-string  problem,  are  difficult  for  precisely  this  reason.  In  the 
case  of  more  difficult  problems,  the  initial  representation  is  even 
more  important.  We  will  have  more  to  say  about  this  issue  later. 


Searching  For  &  Solution 

The  process  of  finding  a  solution  to  a  problem  can  be  visualized 
as  a  search  through  the  paths  in  the  problem  space  until  one  that 
leads  to  the  goal  is  found.  Since  MOVE  problems  are  generally  not 
difficult  to  represent,  research  on  solving  MOVE  problems  tended  to 
concentrate  on  uncovering  the  strategies  that  effective  problem 
solvers  use  to  find  a  solution  in  the  problem  space.  There  are  a 
variety  of  strategies  for  carrying  out  this  search. 

One  strategy  is  to  try  paths  randomly,  hoping  to  stumble  upon  the 
goal.  Random  search  is  adequate  if  the  search  space  is  small. 
However,  since  the  search  space  expands  exponentially  for  most 
problems,  the  chance  of  random  search  being  successful  is  quite  small. 

Another  obvious  strategy  is  systematically  searching  the  entire 
tree.  In  a  depth-first  search,  for  instance,  a  particular  path  is 
searched  all  the  way  to  the  bottom.  If  this  state  is  not  the  goal, 
the  search  backs  up  one  level  and  then  starts  down  again  via  an 
untried  path.  When  all  paths  from  a  particular  state  have  been  tried, 
the  search  backs  up  one  more  level  and  starts  down  again,  and  so  on. 
This  method  (and  any  such  exhaustive  method)  requires  quite  a  bit  of 
record  keeping  to  keep  track  of  which  paths  have  been  tried  and  which 
state  should  be  backed  up  to  when  all  links  have  been  tried.  Except 
for  very  simple  problems,  the  memory  required  for  this  record  keeping 
is  too  great  for  human  beings.  Exhaustive  methods  are  often  not  even 
feasible  for  computers  to  use. 
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The  key  co  the  effective  strategies  actually  used  by  humans  Is  to 
significantly  reduce  the  search  space  by  considering  only  one  or  a 
very  few  branches  at  each  point.  For  instance,  de  Groot  (1966)  found 
that  chess  players  use  a  strategy  that  can  be  called  depth-first 
because  they  initially  follow  one  path  straight  down  for  a  few  moves. 
They  make  no  attempt  to  be  exhaustive  or  to  backtrack  systematically, 
however.  Instead,  they  tend  to  jump  back  to  the  beginning  position  or 
to  some  important  intermediate  point  and  gradually  explore  just  a  few 
alternative  branches.  Obviously,  if  only  a  few  alternatives  at  each 
point  are  going  to  be  explored,  they  had  better  be  good  ones,  so  good 
strategies  are  those  that  guide  the  selection  of  promising  moves  or 
the  elimination  of  unpromising  ones. 

Means/ends  analysis .  A  powerful  strategy  for  finding  good 
alternatives,  which  takes  the  goal  state  into  account,  is  means/ends 
analysis.  This  strategy  was  used  as  an  important  search  strategy  in 
one  of  the  earliest  attempts  to  create  a  computer  program  to  solve 
problems,  the  General  Problem  Solver  (Ernst  &  Newell,  1969).  The 
general  idea  is  to  determine  what  differences  exist  between  the 
current  state  and  the  goal  state  and  then  to  find  operations  that  will 
reduce  them.  If  there  is  more  than  one  such  operation,  the  one  that 
reduces  the  largest  difference  is  applied  first.  In  other  words,  find 
the  best  means  to  achieve  the  desired  end. 

In  the  Tower  of  Hanoi,  the  differences  are  simply  that  disks  are 
not  on  the  proper  peg  in  the  proper  order.  At  the  initial  state,  the 
move  that  does  the  most  to  reduce  the  difference  is  placing  the  small 
disk  on  the  third  peg  (see  state  C,  Fig.  1).  Note,  however,  that  this 
move  actually  introduces  a  new  difference,  because  the  disk  is  in  the 
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wrong  (bottom)  position  on  the  right  (third)  peg.  Thus,  being  in  the 
wrong  position  must  be  defined  to  be  a  smaller  difference  than  not 
being  on  the  right  peg.  That  is,  state  C  is  the  correct  move  because 
there  is  less  of  a  difference  between  state  C  and  the  goal  state,  than 
between  state  A  and  the  goal. 

Means/ends  analysis,  however,  is  not  guaranteed  to  find  the 
solution.  At  state  D  in  Figure  1,  for  example,  the  correct  move  is  to 
take  path  y,  even  though  it  does  not  reduce  the  overall  difference. 
General  methods  like  this  one  which  reduce  the  number  of  moves  that 
must  be  considered  but  which  are  not  guaranteed  to  succeed  in  all 
cases,  are  called  heuristics . 

Not  only  can  means/ends  analysis  be  used  to  work  from  the  initial 
state  to  the  goal,  but  it  also  can  be  used  to  work  backwards  from  the 
goal.  In  this  case,  the  strategy  is  to  find  an  operation  (such  as 
path  m  in  Figure  1)  which  can  produce  the  goal  state  when  it  is 
applied  to  a  state  which  is  closer  to  the  initial  state  (i.e.,  state 
F).  In  the  Tower  of  Hanoi  example,  to  find  state  F,  we  can  transform 
state  E  via  path  z,  working  forward.  To  work  backwards  from  the  goal, 
we  want  to  find  the  transformation  (path  m  in  this  case),  which,  when 
applied  in  a  forward  manner,  will  take  us  from  state  F  to  the  goal. 
Working  backward?,  however,  two  states  (states  F  and  H)  can  be 
transformed  from  the  goal,  only  one  of  which  (state  F)  overlaps  with 
one  of  the  two  states  (states  F  and  G)  generated  in  a  forward  way, 
from  state  E.  Thus,  by  combining  forward  and  backward  searches,  one 
can  select  the  right  move  almost  by  default.  This  example, 
illustrates  the  benefit  of  working  both  forwards  and  backwards. 
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The  addicional  benefit  of  working  backwards  (which  is 
unfortunately  not  true  in  the  Tower  of  Hanoi  case),  is  that  often  the 
number  of  "backward  branches"  to  be  considered  is  smaller  than  the 
number  when  working  forward.  Another  difference  between  the  forward 
and  backward  strategy  is  that  one  can  try  out  moves  as  soon  as  they 
are  decided  upon  in  the  forward  strategy,  whereas  backward  strategy 
requires  a  chain  of  moves  to  be  selected  first.  They  are  then  applied 
in  the  same  order  as  they  would  have  been  using  the  forward  strategy. 
Working  backwards,  thus,  requires  planning,  and  can  be  considered  to 
be  more  sophisticated  than  forward  strategy. 

Although  the  Tower  of  Hanoi  is  quite  a  simple  puzzle,  means/ends 
analysis  is  useful  in  more  complicated  situations.  Newell  and  Simon 
(1972)  observed  subjects  using  it  in  solving  cryptarithmetic  problems 
(in  which  an  arithmetic  problem  is  presented  with  letters  standing  for 
numbers  and  the  solver  must  determine  which  letter  stands  for  which 
number).  Reed  and  Simon  (1976)  show  how  it  is  used  on  the 
missionaries  and  cannibals  problem  (in  which  one  boat  of  limited 
capacity  must  be  used  to  transport  missionaries  and  cannibals  across  a 
river  without  anyone  being  eaten)  and  Atwood  and  Poison  (1976) 
demonstrated  its  use  in  the  water  jug  problems.  Simon  and  Simon 
(1978)  noted  a  subject  who  worked  backwards  while  solving 
distance/ rate/ time  problems;  and  Larkin,  McDermott,  Simon,  and  Simon 
(1980)  observed  similar  solution  strategies  in  solving  physics 
problems . 
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Means/ends  analysis  does  have  limitations,  of  course.  In 
particular,  it  will  fail  if  there  is  no  operation  that  will  reduce  the 
remaining  differences.  Such  an  occasion  will  arise  when  a  problem 
requires  a  "detour"  to  get  to  the  goal.  A  detour  generally  implies 
that  the  differences  between  the  current  and  goal  states  must  be 
Increased,  at  least  temporarily.  This  is  the  situation  that  was 
mentioned  in  state  D  in  Figure  1.  The  differences  from  the  goal  state 
are  that  the  large  and  medium  disks  are  on  the  wrong  pegs  and  the 
small  one  is  in  the  wrong  position  even  though  it  is  on  the  right  peg. 
None  of  the  available  moves  actually  reduces  the  overall  difference. 
Moving  the  smallest  disk  (taking  path  y),  which  is  the  correct  move, 
actually  produces  a  larger  difference  than  the  previous  state. 

Subgoal lng.  A  very  useful  strategy,  which  can  be  used  in 
conjunction  with  means/end  analysis,  is  subgoallng.  Subgoaling  is 
simply  picking  out  an  Intermediate  state  on  the  solution  path  to  reach 
as  the  temporary  goal.  In  effect,  subgoallng  divides  a  problem  into 
two  or  more  subproblems,  thus  transforming  the  entire  search  space 
into  two  or  more  spaces  of  smaller  depth. 

A  good  choice  of  subgoals  can  allow  means/ends  analysis  to  be 
used  in  situations  where  alone  it  might  fail  to  meet  the  criterion  of 
reducing  a  difference,  as  when  a  detour  is  needed.  In  the  Tower  of 
Hanoi,  one  useful  subgoal  is  to  get  the  largest  disk  onto  its  proper 
peg,  because  it  must  be  put  there  first  before  the  others  can  be 
placed  on  top  of  it.  Look  at  state  D  in  Figure  1  again,  with  that 
subgoal  in  mind.  Now  the  differences  to  consider  are  that  the  large 
one  is  on  the  wrong  peg  and  the  small  one  is  blocking  it.  By  using 
means/ ends  analysis  now,  the  correct  move  (taking  path  y)  does  reduce 
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the  difference  while  the  less  efficient  ones  do  not.  Thus,  subgoaling 
can  remove  the  apparent  need  to  move  away  from  a  goal  in  order  to  get 
to  it.  In  effect,  when  subgoaling  is  used  with  means/ends  analysis, 
it  removes  the  limitation  of  means/ends  analysis. 

Subgoaling  also  serves  to  reduce  the  search  space  significantly. 
Remember  that  the  size  of  the  space  increases  exponentially  with  its 
depth.  If  the  subgoal  we  have  selected  is  on  the  solution  path,  then, 
once  we  have  reached  it,  the  number  of  alternative  paths  to  be 
considered  to  reach  the  goal  is  reduced  significantly,  as  Figure  4 
shows.  The  obvious  advantage  to  problem  solving  is  that  a  larger 
problem  is  now  divided  into  smaller  ones. 

A  pragmatic  difficulty,  though.  Is  how  to  choose  useful  subgoals. 
There  are  several  heuristics  for  finding  them.  One  way  is  to  work, 
backward  first,  and  then  use  the  new  state  that  one  has  reached  as  the 
subgoal  to  work  forward  to.  Another  method  is  to  "decompose"  the  main 
goal.  In  the  Tower  of  Hanoi  problem,  the  goal  is  to  have  the  set  of 
disks  on  the  third  peg  in  a  certain  order,  and  an  obvious  subgoal  is 
to  first  put  the  larger  disk  on  the  third  peg.  In  order  to  achieve 
this  subgoal,  the  third  peg  must  be  empty  and  the  large  disk  must  be 
uncovered.  In  other  words,  the  other  disks  must  be  on  the  middle  peg. 
Because  large  disks  can  never  be  on  top  of  small  ones,  a  two-disk 
subgoal  Is  generated  with  the  middle  peg  as  the  goal  peg.  This  form 
of  the  subgoal  is  very  useful  when  there  are  more  than  three  disks  in 
the  problem,  because  it  decomposes  the  puzzle  into  a  hierarchical 
series  of  smaller  problems. 


Another  heuristic  for  setting  subgoals  is  to  consider  only  one 
constraint  at  a  time.  This  serves  systematically  to  narrow  down  the 
problem  space.  Actually,  this  may  be  the  way  most  people  solve  some 
everyday  problems,  since  they  may  not  initially  think  of  all  the 
constraints  involved.  Suppose,  for  example,  that  you  are  buying  your 
first  house.  Your  Initial  constraint  is  that  the  price  be  under 
$50,000.  After  looking  around  at  a  few  houses,  you  realize  that  in 
order  to  cut  down  on  the  heating  bi  s,  you  need  a  well-insulated 
contemporary  house  rather  than  one  of  those  old  mansions.  Thus,  your 
next  search  will  be  limited  only  to  smaller  contemporary  houses,  but 
still  under  the  initial  price  constraint  of  $50,000. 

Although  theoretically,  subgoaling  seems  to  be  an  ideal  strategy, 
research  on  the  effect  of  providing  a  subgoal  does  not  find  a  clear 
facilitating  effect.  Providing  a  subgoal  for  problem  solvers  can 
create  more  confusion  because  they  seem  not  to  know  how  to  proceed 
after  the  subgoal  has  been  reached.  Hayes  (1966),  for  example,  found 
that  people  took  longer  to  solve  problems  when  a  subgoal  was  provided, 
even  though  the  subgoal  helped  them  reach  the  situation  quite  easily. 
It  may  be  the  case  that  unless  problem  solvers  define  their  own 
subgoals,  providing  one  will  not  be  as  effective. 

Generate-and-test .  A  heuristic  that  is  useful  l.i  many  cases,  is 
to  generate  a  set  of  possible  solutions  to  a  given  problem  directly, 
and  then  to  test  each  one  to  see  if  it  is  the  correct  solution. 
Consider  the  task  of  opening  a  small  combination  lock.  Possible 
solutions  are  all  short  series  of  numbers  (usually  three)  that  fall 
between  the  smallest  and  largest  numbers  on  the  dial.  These  are  easy 
to  generate.  Testing  is  also  easy  —  just  turn  the  dial  and  see  if 


the  lock  opens.  Fortunately  (or  unfortunately,  depending  on  your 
situation),  the  total  number  of  possible  solutions  is  quite  large,  and 
the  possibility  of  success  in  this  case  is  quite  small. 

Generate-and-Test  is  useful  only  when  it  is  reasonably  easy  to 
generate  the  set  of  potential  solutions  and  test  them;  the  set  must 
be  fairly  small;  and  the  path  from  the  current  state  to  the  solution 
should  be  unimportant  for  the  problem  solver,  or  the  search  space  is 
difficult  to  generate. 

Particularly  relevant  examples  of  the  generate  and  test  strategy 
are  scientific  research  and  medical  diagnosis.  In  research,  an 
invesclgator  will  generate  a  hypothesis  to  explain  some  observed 
phenomena  (the  behavior  of  humans  solving  problems,  for  instance)  and 
then  devise  experiments  to  test  it.  In  medicine,  physicians  typically 
make  a  tentative  diagnosis  (or  hypothesis)  based  upon  a  partial 
description  of  the  initial  state  (the  patient's  symptoms)  and  then 
order  various  examinations  and  laboratory  tests  to  confirm  or 
disconfirm  the  diagnosis.  In  both  cases,  even  if  the  initial 
hypothesis  is  disconf irmed ,  the  testing  process  generally  adds  new 
information  about  the  initial  state  and  can  lead  to  new  hypotheses. 
These  examples  lend  themselves  particularly  well  to  a 
generate-and-test  heuristic,  mainly  because  it  is  obviously  difficult 
to  find  a  set  of  operations  that  can  transform  the  initial  state  into 
the  current  state.  In  the  medical  diagnosis  example,  the  initial 
state  consists  mainly  of  symptoms,  and  the  goal  state  is  to  explain 
what  causes  them.  Typically,  this  initial  state  is  incompletely 
known;  some  of  the  symptoms  may  be  irrelevant;  and  there  may  be 
multiple  causes  at  work  (a  patient  may  be  suffering  from  heart,  lung 
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and  kidney  problems,  for  Instance).  In  such  a  case,  it  is  much  easier 
to  hypothesize  a  particular  cause  and  determine  what  the  effects  would 
be  by  comparing  them  with  the  known  symptoms. 


In  sum,  Che  outcome  of  the  research  on  puzzle-type  problems  has 
been  very  useful  in  the  uncovering  of  several  standard  and  powerful 
general-purpose  problem  solving  techniques:  means/ends  analysis, 
subgoaling,  and  generate-and-test .  These  heuristics  have  been 
observed  to  be  used  across  a  wide  range  of  problems,  from  puzzles  to 
physics  problem  solving  to  medical  diagnosis. 


Domain  Problems  and  Knowledge 


Although  research  on  puzzle  problem,  particularly  MOVE  problems, 
flourished  well  into  the  seventies  and  provided  valuable  Insights  into 
the  kinds  of  general  problem  solving  strategies  people  use,  it  became 
apparent  that  in  more  complicated  kinds  of  problems,  strategies  alone 
did  not  sufficiently  describe  problem  solving  performance.  Knowledge 
of  the  problem  domain  is  also  important  and  can  Influence  the  use  of 
general  problem  solving  heuristics.  Striking  demonstrations  of  the 
influence  of  knowledge  on  problem  solving  processes  were  provided  by 
researchers  in  both  cognitive  psychology  and  artificial  Intelligence. 
In  cognitive  psychology,  de  Groot  (1966)  and  then  Chase  and  Simon 
(1973-a,b)  explored  what  makes  master  chess  players  different  from 
less  expert  ones.  De  Groot  obtain  protocols  from  former  world 
champions  and  from  some  fairly  skillful  club  players  as  they  tried  to 
find  the  best  move  in  a  given  situation.  Surprisingly,  he  found  no 
strategy  differences.  All  players  tended  to  consider  the  same  number 
of  moves.  They  also  looked  ahead  about  the  same  number  of  moves  as 
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they  tried  to  evaluate  each  move,  and  they  used  the  same  strategy  to 
guide  this  search.  In  contrast  to  other  players,  experts  simply 
recognized  the  best  move  and  gave  it  first  consideration,  evaluating 
the  other  moves  only  as  a  way  of  double  checking  themselves.  This 
ability  to  perceive  the  problem  in  a  way  that  restricts  the  problem 
3pace  has  since  been  shown  to  occur  in  other  areas  as  well.  For 
instance,  in  medical  diagnosis  research,  both  experts  and  novices  are 
found  to  use  the  same  kind  of  generate-and-test  heuristic.  The 
experts,  however,  start  with  a  more  accurate  hypothesis  (Elsteln, 
Shulman,  &  Sprafka,  1978).  Thus,  what  differentiates  an  expert 
problem  solver  from  non-expert  is  not  the  use  of  different  or  more 
powerful  heuristics,  but  performance  that  allows  the  expert  to  succeed 
in  pursuing  the  better  path  to  solution  without  considering  all  the 
others. 

Research  in  artificial  intelligence  aimed  at  creating  chess 

playing  programs  (e.g.,  Wilkins,  1980)  has  further  demonstrated  that 

human  skill  is  not  based  on  strategies  to  guide  search  through  the 

solution  space.  The  number  of  possible  moves  at  a  given  position  can 

be  fairly  large,  perhaps  30  or  35,  and  because  the  size  of  a  search 

space  increases  exponentially  with  each  subsequent  move  on  a  given 

possible  path  of  play,  the  size  of  the  chess  space  is  very  large 
,  120 

(somewhere  around  10  paths).  Even  the  most  powerful  computers  are 
not  able  to  directly  search  the  whole  space  this  large,  and  it  must  be 
drastically  reduced  by  using  sophisticated  computational  strategies  to 
evaluate  and  eliminate  moves.  Although  these  programs  can  play  at 
tournament  level,  they  succeed  because  they  can  do  far  more  search 
than  humans,  using  complex  statistical  strategies  to  compute  and 
evaluate  the  best  move  in  a  way  that  humans  do  not.  Even  with  these 


advantages,  they  still  cannot  match  the  best  players.  Clearly,  good 
chess  players  do  something  very  different  from  searching  through  the 
space  for  a  good  move. 

The  evidence  indicates  that  what  humans  actually  do  is  to  build 
up  an  extremely  large  store  of  knowledge  about  typical  chess  positions 
through  years  of  experience.  De  Groot  (1966)  provided  a  clue  when  he 
found  that  chess  masters,  when  shown  a  chess  position  for  a  very  brief 
duration  (five  seconds),  were  able  to  remember  it  with  very  high 
accuracy,  whereas  the  less  expert  players  could  not.  This  difference 
could  not  be  attributed  to  the  experts'  superior  visual  memory  because 
when  random  board  positions  were  used  that  were  not  like  chess 
patterns,  masters  did  as  poorly  as  novices  (Chase  &  Simon,  1973-a). 
Using  very  simple  techniques.  Chase  and  Simon  showed  that  the  masters 
were  actually  perceiving  groups  of  pieces,  or  chunks .  They  asked 
subjects  to  reproduce  from  memory  given  board  positions  just  shown  to 
them,  or  to  copy  a  position  from  one  board  to  another.  During  the 
reproduction  task,  groupings  could  be  detected  if  significantly  longer 
pauses  occurred  after  the  placement  of  several  individual  pieces  in 
quick  succession.  Because  the  contents  of  a  chunk  are  closely 
associated  in  memory,  once  a  chunk  is  accessed,  recall  of  its 
component  pieces  will  be  rapid.  However,  more  time  is  needed  to 
access  another  chunk.  When  copying  a  position,  clear  evidence  of 
grouping  can  be  detected  by  the  periodic  glances  at  the  board.  The 
subject  would  to  turn  and  look  at  the  given  board,  place  a  series  of 
pieces,  then  look  back  again,  place  the  next  series  of  pieces,  and  so 
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Significant  pauses  and  head  curns  as  evidence  of  chunking  were 
observed  with  both  expert  and  novice  subjects.  However,  expert 
groupings  contained  three  to  six  pieces,  whereas  novice  groupings 
contained  only  one  or  two.  It  is  the  larger  sizes  of  the  experts' 
chunks  that  enabled  them  to  remember  so  many  more  pieces  from  a 
chessboard  than  novices  can.  Moreover,  the  pieces  in  the  expert 
patterns  were  related  in  identifiable  ways  to  chess  knowledge.  They 
constituted  very  common  patterns,  such  as  a  castled-king  or  a  pawn 
chain,  or  local  clusters  of  pieces.  Additional  support  for  the 
existence  of  chunks  comes  from  eye  movement  studies.  When  experts 
were  asked  to  analyze  a  board  position,  their  eye  fixations  showed 
that  they  were  concentrating  on  groups  of  pieces,  such  as  those  in 
important  attack  and  defense  relationships  (Simon  &  Barenfeld,  1969). 

Differences  in  chunking  between  experts  and  non-experts  have  been 
observed  in  other  domains,  as  well.  In  electronics,  Egan  and  Schwartz 
(1979)  found  that  skilled  technicians  reconstructing  symbolic  drawings 
of  circuit  diagrams  did  so  according  to  the  functional  nature  of  the 
elements  In  the  circuit  such  as  amplifiers,  rectifiers,  and  filters. 
Novice  technicians,  however,  produced  chunks  based  more  upon  the 
spatial  proximity  of  the  elements.  When  Akin  (1980)  asked  architects 
to  reconstruct  building  plans  from  memory,  several  levels  of  patterns 
were  produced.  First,  local  patterns  consisting  of  wall  segments  and 
doors  are  recalled,  then  rooms  and  other  areas,  then  clusters  of  rooms 
or  areas.  In  other  words,  they  exhibited  a  hierarchical  pattern  of 
chunks  within  chunks. 
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The  important  implication  of  this  research  is  that  when  experts 
look  at  an  apparently  complicated  situation,  they  are  able  to 
represent  it  in  terms  of  a  small  number  of  patterns  or  chunks*  If  the 
situation  is  very  intricate  as  with  the  architectural  drawings,  their 
knowledge  is  further  organized  into  embedded  sets  or  hierarchical 
structures. 

How  Structured  Knowledge  Facilitates  Problem  Solving 

Although  understanding  the  cognitive  processes  involved  in 
finding  a  good  chess  move  is  clearly  a  difficult  and  challenging 
problen  for  analysis,  we  are  especially  interested  in  what  these 
findings  tell  us  about  solving  the  kinds  of  problems  we  encounter  in 
the  classroom  and  in  life  in  general.  Remember  that  the  initial 
representation  of  a  problem  is  very  important  in  determining  how  easy 
the  problem  is  to  solve.  Just  as  a  chess  player's  knowledge  allows  a 
representation  of  a  given  situation  to  be  formed,  a  problem  solver's 
knowledge  somehow  determines  the  problem  representation.  Then  the 
proper  problem  solving  procedures,  if  they  are  known,  must  be 
retrieved  from  memory  and  applied.  It  is  the  problem  solver's 
representation  that  guides  retrieval  of  appropriate  solution 
procedures. 

In  order  to  consider  this  representation  in  more  detail,  it  is 
very  helpful  to  use  the  concept  of  a  schema.  A  schema  is  a 
theoretical  construct  which  describes  the  format  of  an  organized  body 
of  knowledge  in  memory.  A  schema  is  conceived  of  as  a  modifiable 
Information  structure  that  represents  generic  concepts  stored  in 
memory.  Schemata  represent  knowledge  that  we  experience  — 
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Interrelationships  between  objects,  situations,  events,  and  sequences 
of  events  that  normally  occur.  In  this  sense,  schemata  contain 
prototypical  information  of  frequently  experienced  situations,  and 
they  are  used  to  interpret  new  situations  and  observations  (Rumelhart, 
1981).  In  many  situations,  much  information  is  left  out  so  that  one 
could  never  understand  a  situation  without  filling  it  in  by  means  of 
prior  knowledge.  Estes  (National  Academy  of  Sciences,  1981)  explains 
this  point  by  describing  the  following  vignette:  "At  the  security 
gate,  the  airline  passenger  presented  his  briefcase.  It  contained 
metallic  objects.  His  departure  was  delayed."  In  order  to  understand 
this  commonplace  incident,  an  individual  must  have  a  good  deal  of 
prior  knowledge  of  air  terminals.  This  kind  of  prior  knowledge  is 
represented  in  memory  by  a  schema  that  specifies  the  relationship 
between  the  roles  played  by  various  people  in  the  terminal,  the 
objects  typically  encountered,  and  the  actions  that  typically  ensue. 
Schema  theory  assumes  that  there  are  memory  structures  (schemata)  in 
memory  for  recurrent  situations  that  are  experienced,  and  a  major 
function  of  schemata  is  the  construction  of  an  interpretation  of  a  new 
situation. 

The  objects  of  a  schema  may  be  thought  of  as  variables  or  slots 
into  which  incoming  information  can  fit.  If  enough  slots  of  a 
particular  schema  are  filled,  it  becomes  active.  An  active  schema  can 
then  guide  and  seek  information  to  fill  remaining  slots.  If  such 
additional  information  is  not  available  in  the  environment,  then  the 
schema  will  fill  its  slots  with  information  typical  of  a  particular 
situation,  activate  procedures,  and  make  any  other  knowledge  it 
contains  easily  accessible.  In  effect,  the  schema  is  a  prototypical 
structure  that  can  incorporate  observed  phenomena.  People  often  react 


very  rapidly  and  effectively  to  incoming  stimuli.  Recognizing  a 
familiar  face  (for  example)  seems  to  happen  almost  instantaneously  and 
the  speed  is  independent  of  the  number  of  faces  one  knows.  In  this 
case,  it  can  be  said  that  the  features  of  a  face  fit  into  slots  in 
"face  schemata,"  and  one  of  them  becomes  active  and  provides  the 
person's  name. 

Schemata  are  closely  related  to  chunks.  When  a  chess  player 
looks  at  the  board,  the  individual  pieces  can  fit  into  the  slots  of 
"chess  pattern  schemata."  These  schemata  then  provide  the  names  or 
symbols  that  represent  patterns  in  memory  and  later  enable  the  player 
to  retrieve  the  individual  pieces  when  they  are  needed.  Further,  a 
set  of  active  pattern  schemata  can  activate  some  higher  level 
"position  schema,"  which  provides  the  move.  With  the  concept  of 
schema  in  mind,  we  can  look  in  more  detail  at  how  knowledge  and  its 
organization  affect  problem  solving. 

It  is  now  easier  to  think  about  how  problem  representations  are 
formed.  Essentially,  they  are  determined  by  the  existing  schemata  and 
the  slots  they  contain.  If  a  problem  is  of  a  very  familiar  type,  it 
can  trigger  an  appropriate  problem  schema,  if  not,  some  more  general 
schema  will  be  triggered.  In  any  event,  the  slots  in  the  schema 
control  what  features  of  the  problem  are  incorporated  into  the 
representation;  features  that  do  not  fit  into  a  slot  will  be  ignored. 

Once  a  schema  is  triggered,  it  can  control  the  solution  if  it 
contains  the  necessary  information.  If  it  is  a  specific  and 
appropriate  schema,  it  might  contain  precisely  the  right  procedures. 
In  this  case,  the  solver  can  proceed  easily  and  rapidly.  If  it  is  a 
general  schema,  it  might  only  contain  a  general  prescription  for  how 
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to  proceed.  In  this  case  the  solver  will  have  to  search  for 
procedures  which  fit  the  given  situation  and  the  general  prescription. 
The  solution  will  then  be  much  more  difficult  and  may  be  impossible  if 
the  proper  procedures  can  not  be  found.  If  an  inappropriate  schema  is 
somehow  triggered,  the  solver  will  not  make  any  progress  at  all. 
Thus,  the  importance  of  the  knowledge  structure,  how  it  is  organized 
into  schemata,  becomes  clear.  It  is  the  organization  and  structure 
provided  by  schemata  which  allow  relevant  knowledge  to  be  found  in 
memory,  and  although  lack  of  knowledge  may  sometimes  be  a  reason  for 
failure  on  a  problem,  lack  of  access  to  that  knowledge  because  of 
Inadequate  structure  may  also  be  the  cause. 

There  is  some  experimental  support  for  this  interpretation.  For 
Instance,  Hinsley,  Hayes,  and  Simon  (1978)  asked  high  school  and 
college  students  to  classify  76  algebra  problems  in  any  way  they 
wished.  There  was  considerable  agreement  among  the  students  on  the 
types  of  categories.  Problems  were  grouped  that  would  be  solved  in 
the  same  way,  such  as,  triangle  problems,  ratio  problems,  and 
river-current  problems.  Furthermore,  students  could  classify  these 
problems  after  hearing  only  the  first  sentence.  For  example,  a 
problem  starting  with  "John  walked  three  miles  east  and  then  four 
miles  north..."  could  readily  be  classified  as  a  triangle  problem; 
that  is,  one  using  the  Pythagorian  theorem  for  solution.  The  rapidity 
with  which  problems  are  classified  appears  to  rule  out  the  possibility 
that  problem  categories  are  identified  after  the  students  have 
formulated  a  solution.  Instead,  it  is  suggested  that  the  problems 
were  indeed  triggering  some  appropriate  schemata  in  memory. 
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The  existence  of  specific  schemata  can  also  be  uncovered  by 
showing  how  they  can  be  mistakenly  accessed.  For  example,  poorer 
students,  in  particular,  will  classify  problems  erroneously  if  they 
contain  "cover  stories"  that  trigger  a  specific  schema  (Silver,  1981). 
A  problem  used  by  Hinsley,  Hayes,  and  Simon  (1978)  might  be  mistakenly 
identified  as  a  triangle  problem  even  though  it  is  actually  a 
distance-rate-time  problem.  This  is  because  irrelevant  information 
concerning  the  triangular  relation  among  the  three  problem  elements 
have  been  introduced.  Good  students  were  not  fooled,  however. 

A  similar  contrast  was  found  by  Chi,  Feltovich  and  Glaser  (1981). 
Students  who  had  completed  introductory  physics  with  an  A  grade  and 
physics  instructors  group  typical  physics  problems.  The  students 
tended  to  group  problems  which  contained  similar  physical  entities, 
(as  shown  in  Fig.  5),  while  the  instructors  grouped  problems  according 
to  the  underlying  principle  (as  shown  in  Fig.  6).  Thus,  from  these 
studies,  it  is  apparent  that  good  and  experienced  problem  solvers  are 
not  unduly  influenced  by  the  superficial  features  of  a  problem 
statement. 

The  results  of  these  categorization  studies,  which  show  that  good 
and  experienced  problem  solvers  are  not  deceived  by  cover  stories. 
Indicate  that  their  skill  arises  from  the  complexity  and  completeness 
of  their  schemata.  Their  schemata  must  contain  rules  which  are  more 
complex  than  simple  linking  of  superficial  (or  "primary")  cues  in  the 
problem  statement  with  solution  procedures.  The  schemata  of  the 
novice  learners  may  be  developed  Initially  with  these  simple  rules. 
But,  in  order  for  novices  to  learn  not  to  be  misled  by  cover  stories, 
they  must  also  develop  "secondary"  knowledge  (Chi,  Feltovich,  & 
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Glaser,  1981);  chat  Is,  knowledge  which  incorporates  the  interaction 
among  the  primary  cues  in  the  problem  statement  and  which  implies  a 
generally  applicable  rule.  Hence,  a  complete  schema  must  contain  not 
only  che  procedures  (such  as  the  equation  to  compute  the  hypothenuse 
of  a  triangle),  but  also  the  primary  and  secondary  cues  to  identify 
the  problem  type  appropriately. 

Another  way  to  illustrate  these  separate  components  of  schemata 
is  to  use  problem  isomorphs.  These  are  secs  of  problems  whose 
underlying  structures  and  solutions  are  all  the  same,  but  whose 
context  can  be  quite  different.  An  example  of  a  problem  isomorph  is 
the  Tea  Ceremony  (Hayes  &  Simon,  1974).  In  it,  three  people  are 
conducting  an  oriental  tea  ceremony  and  they  must  distribute 
responsibility  for  various  parts  of  the  ceremony  among  themselves 
according  to  certain  rules.  In  actuality,  this  puzzle  is  the  familiar 
Tower  of  Hanoi  with  the  pegs  replaced  by  people  and  the  disks  by  parts 
of  the  ceremony.  Anyone  who  is  familiar  with  the  Tower  problem  and 
who  notices  the  correspondence  can  solve  the  Tea  Ceremony  quite 
easily.  However,  studies  show  that  virtually  no  one  notices  the 
similarity.  One  way  to  interpret  this  finding  is  that  slots  of  the 
Tower  of  Hanoi  schema  were  designed  specifically  for  pegs  and  disks. 
Thus,  when  people  and  ceremonies  were  presented  instead,  the  Tower  of 
Hanoi  solution  schema  was  not  accessed,  even  though  the  subjects 
possessed  very  efficient  procedures  for  solving  the  Tea  Ceremony 
puzzle. 
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In  sum,  studies  on  the  solution  of  problems  where  a  great  deal  of 
domain  knowledge  is  involved  indicate  clearly  that  a  very  relevant 
part  of  success  in  problem  solving  is  the  access  of  a  large  body  of 
well-structured  domain  knowledge.  Hence,  one  important  direction  of 
current  research  is  to  explore  how  a  large  body  of  knowledge  is 
organized  and  represented,  so  that  it  can  be  easily  accessed  for 
successful  solving  of  problems. 


Ill-structured  Problems 


Perhaps  the  best  way  to  define  an  ill-defined  problem  is  by 
default.  That  is,  problems  which  do  not  fall  into  the  class  of 
well-defined  problems,  as  those  that  have  been  studied  and  described 
above,  may  be  considered  an  ill-defined  problem.  One  framework  that 
can  be  used  to  conceptualize  ill-defined  problem  is  in  terms  of  the 
information  processing  approach  mentioned  earlier  in  the  context  of 
solving  puzzle  problems.  In  that  framework,  a  problem  has  a  clear 
initial  state,  a  set  of  permissible  operators,  and  a  goal  state.  A 
problem  qualifies  to  be  an  ill-defined  one  if  any  one  or  all  of  the 
three  components  are  not  well-specified.  For  example,  the  initial 
state  may  be  vague.  Consider  economic  problems,  for  Instance.  Our 
economy  is  so  complex  that  we  really  do  not  understand  it  very  well. 
For  any  given  economic  problem  we  have  only  a  partial  picture  of  the 
initial  state  based  on  all  of  the  various  statistics  the  government 
and  other  agencies  collect.  Not  only  do  we  have  Incomplete  data  to 
describe  the  initial  state,  but  even  professional  economists  cannot 
agree  on  the  interpretation  of  the  statistics  that  we  do  have.  A 
problem  can  also  be  considered  ill-defined  if  the  operators  are  not 
well  specified.  In  our  economic  problem,  the  various  actions  that 
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might  be  taken  to  modify  the  initial  state  are  not  clear  and  many 
possible  actions  have  not  even  been  thought  of  yet.  And  finally,  the 
problem  is  ill-defined  if  the  goal  state  is  not  clear.  What 
conditions  of  controlled  inflation  and  unemployment  are  to  be 
attained?  How  much  is  too  much?  As  you  know,  experts  disagree  on  the 
answer  to  these  questions.  In  fact,  one  prominent  characteristic  of 
ill-defined  problem  is  that  there  is  generally  a  lack  of  consensus 
even  amount  the  experts  with  respect  to  what  the  appropriate  solution 
is. 

There  has  been  very  little  research  done  on  ill-structured 
problems.  One  interesting  piece  of  work,  recently  conducted  by  Voss 
and  his  associates  (Voss,  Greene,  Post  &  Penner,  in  press;  Voss, 
Tyler  4  Yengo,  in  press)  used  problems  from  the  social  sciences,  much 
like  the  economic  problem  we  have  just  described.  Imagine  you  are  the 
Minister  of  Agriculture  for  the  Soviet  Union.  Crop  productivity  has 
been  too  low  for  the  past  several  years.  What  would  you  do  to 
Increase  crop  production?  This  ill-structured  problem  has  all  the 
components  unspecified.  The  initial  state  is  far  more  complicated 
than  just  bad  crops.  Among  other  things,  the  Soviet  political  system, 
the  current  agricultural  system,  the  amount  of  arable  land  and  the 
weather  are  all  part  of  it.  The  task  is  to  find  some  actions  or 
operations  that  may  be  taken  to  Improve  the  situation,  but  there  is  no 
mention  at  all  of  what  they  might  be.  And,  finally,  the  goal  is 
vaguely  stated.  How  much  of  an  Increase  is  an  appropriate  goal?  5Z? 


20Z?  200Z? 
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In  order  to  try  to  disentangle  the  various  effects  of  knowledge, 
Voss  and  colleagues  used  subjects  who  were  political  scientists 
specializing  in  the  Soviet  Union,  students  who  were  taking  a  political 
science  course  on  Soviet  domestic  policy,  and  chemistry  professors. 
Dot  surprisingly,  they  found  strong  knowledge  effects.  The  most 
comprehensive  and  detailed  solutions  were  produced  by  the  Soviet 
experts,  and  the  worst  were  the  students  and  the  chemists.  About  24% 
of  the  solution  protocols  of  the  Soviet  experts  were  involved  with 
elaborating  the  initial  state  of  the  problem,  as  opposed  to  1%  for  the 
students  and  the  chemists. 

Prominent  in  the  experts'  protocols  at  the  initial  phase  of 
problem  solving  were  the  identification  of  possible  constraints,  such 
as  the  Soviet  Ideology  and  the  amount  of  arable  land.  Defining 
constraints  will  provide  a  means  of  testing  possible  solutions,  such 
as,  fostering  private  competition  (a  capitalistic  solution)  or 
increasing  planting  which  can  be  rejected  inmediately  under  these 
constraints. 

The  obvious  way  to  solve  a  problem  of  this  sort  is  to  eliminate 
its  causes,  thus,  the  importance  of  enumerating  them  in  the  problem 
representation.  The  realization  that  there  are  a  series  of  separate 
causes  leads  naturally  to  the  decomposition  of  the  problem  into 
subproblems,  that  is,  subgoaling.  All  of  the  subjects  from  expert  to 
novice  used  this  strategy.  The  differences,  however,  were  that  the 
experts  tended  to  create  a  few  very  general  subproblems  which  might 
encompass  several  related  causes,  while  the  novices  related  solutions 
very  directly  to  individual  causes.  For  instance,  one  expert 
identified  the  Soviet  bureaucracy,  the  attitudes  of  the  peasants 


toward  agricultural  practices,  and  the  lack  of  infrastructure 
(railroads,  fertilizer  plants,  fuel  distribution  networks,  etc.)  as 
the  main  subproblems.  In  the  most  extreme  examples  of  this  tendency, 
the  problem  was  recast  into  a  single  "subproblem,"  such  as,  inadequate 
technological  development. 

There  are  at  least  two  conclusions  we  can  draw  from  this 
research.  The  first  is  that  even  in  ill-defined  problems,  solvers  use 
heuristics  not  unlike  those  that  they  use  in  well-defined  problems, 
such  as  subgoallng.  This  suggests  that  knowledge  of  the  problem 
domain  really  makes  it  easier  for  one  to  define  the  problem,  such  as 
identifying  the  constraints.  For  example,  Reitman  (1965)  found  from 
his  protocols  of  a  composer  writing  a  fugue,  that  at  any  moment  in 
time,  the  composer  was  really  dealing  with  a  very  well-defined 
subproblem,  even  though  the  subproblem  required  some  initial  work  to 
define  it. 

Because  ill-defined  problems  require  this  special  component  — 
the  process  of  adding  information  to  the  problem  situation  —  one 
sometimes  refers  to  the  solution  of  these  problems  as  a  creative  act 
(Newell,  Shaw,  &  Simon,  1964).  Another  name  for  a  creative  act  is 
insight.  The  term,  insight,  often  connotes  the  meaning  that  the 
solution  was  achieved  in  one  single  step,  rather  than  a  series  of 
discrete  transformations,  as  in  the  case  of  well-defined  problems. 
The  two-string  problem  we  mentioned  earlier  seems  to  satisfy  our 
intuition  of  insight.  Contemporary  research  is  just  beginning  to 
explore  the  process  of  insight  and  it  appears  that  insight  itself  can 
be  decomposed  into  several  component  processes,  such  as  encoding  the 
information  in  a  selective  way,  combining  information  in  a  novel  way 


or  comparing  certain  aspects  of  two  objects  (Sternberg  &  Davidson, 
1983).  For  example,  Fleming's  discovery  of  penicillin  required  him  to 
notice  that  the  bacteria  in  the  vicinity  of  a  moldy  dish  containing  a 


culture  was  destroyed.  Therefore,  one  could  say  that  Fleming  was 
simply  more  insightful  or  creative  than  the  other  researchers. 
However,  encoding  information  selectively  requires  that  one  knows 
which  piece  of  information  is  relevant.  That  is,  Fleming  must  have 
some  stored  knowledge  that  only  certain  potent  substances  can  destroy 
the  bacteria.  Hence,  two  tentative  conclusions  can  be  gleaned  from 
the  current  preliminary  research.  First,  creative  and  insightful  acts 
may  not  necessarily  be  discrete  and  unitary.  They  may  actually  be 
composed  of  a  sequence  of  subprocesses.  Second,  the  execution  of 
these  subprocesses  in  an  apparently  creative  way  may  require  some 
existing  stored  knowledge. 


Over  the  years,  psychologists  have  learned  a  lot  about  the  nature 
of  the  problem  solving  process.  The  importance  of  the  initial 
representation  of  a  problem  was  discovered  quite  a  few  years  ago  by 
the  Gestalt  school  in  its  examination  of  insight  problems.  The  actual 
nature  of  initial  representations  and  their  influence  on  problem 
solving  was  made  clear  only  in  the  last  couple  of  decades,  however, 
when  the  notion  of  a  solution  space  was  developed.  There  was  a  time, 
not  too  long  ago,  when  researchers,  especially  those  in  artificial 
intelligence,  thought  that  effective  problem  solving  might  be  due 
mainly  to  general  strategies  for  guiding  search  through  problem 
spaces.  Early  attempts  to  create  computer  programs  to  solve  problems, 
such  as  the  General  Problem  Solver,  cook  this  approach. 
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As  we  have  seen,  though,  this  fairly  simple  picture  of  problem 
solving  has  recently  been  shown  to  be  far  too  simple.  Specific 
knowledge  of  a  domain  is  of  overriding  importance  in  the  effective 
solution  of  problems.  In  addition,  this  knowledge  must  be  well 
structured  so  that  relevant  knowledge  can  be  accessed  at  the  proper 
time.  Research  is  beginning  to  uncover  just  what  "well  structured" 
means,  but  considerable  work  is  left  to  be  done  on  the  problem  of  how 
we  are  able  to  retrieve  information  in  a  rapid  and  effective  manner 
from  the  wealth  of  knowledge  we  all  possess. 

Even  more  work  needs  to  be  done  on  the  kinds  of  problems  that  are 
probably  the  most  important  to  us  —  ill-structured,  real-world 
problems.  The  process  of  solving  such  problems  is  very  difficult  and 
complex  and,  thus,  very  difficult  to  study,  especially  because  it  is 
often  not  clear  whether  an  ill-structure  problem  has  been  solved.  In 
addition,  the  moat  Important  real-world  problems  are  often  solved  (or 
not)  through  complex  interactions  among  groups  of  people,  the  nature 
which  are  little  understood. 


Fig.  1.  The  Tower  of  Hanoi:  puzzle  and  its  solution  states. 


Fig.  2 
Fig.  3 
Fig.  4 
Fig.  5 

Fig.  6 


.  The  nine  dots  problem  (a) ,  typical  solution  attempts  (b) ,  and  the 
solution  (c). 

.  The  sixteen  matches  problem  (a),  two  common  faulty  representations 
of  the  goal  (b) ,  and  two  correct  representations  of  the  goal  (c). 

.  The  shaded  area  shows  how  much  of  the  search  space  is  reduced  when 
a  subgoal  on  the  correct  path  is  provided. 

.  Examples  from  novices'  problem  categories.  Problem  numbers  represent 
chapter  and  problem  number  from  Halliday  and  Resnick  (1974).  (From 
Cognitive  Science,  1981,  5^(2). 

.  Example  from  experts'  problem  categories.  Problem  numbers  represent 
chapter  and  problem  number  from  Halliday  and  Resnick  (1974).  (From 
Cognitive  Science,  1981,  5^(2). 


TABLE  1 


The  Smalltown  problem 

Because  of  their  quiet  ways,  the  inhabitants  of  Smalltown  were  especially 
upset  by  the  terrible  New  Years  Eve  auto  accident  which  claimed  the  life  of 
one  Smalltown  resident.  The  facts  were  these.  Both  Smith  and  Jones  were  New 
Years  Eve  babies  and  each  had  planned  a  suprise  visit  to  the  other  one  on 
their  mutual  birthday.  Jones  had  started  out  for  Smith's  house  traveling  due 
east  on  Route  210  just  two  minutes  after  Smith  had  left  for  Jones'  house. 

Smith  was  traveling  directly  south  on  Route  140.  Jones  was  traveling  30  miles 
per  hour  faster  than  Smith  even  though  their  houses  were  only  five 'miles  apart 
as  the  crow  flies.  Their  cars  crashed  at  the  right  angle  intersection  of  the 
two  highways.  Officer  Franklin,  who  observed  the  crash,  determined  that  Jones 
was  traveling  half  again  as  fast  as  Smith  at  the  time  of  the  crash.  The  crash 
occurred  nearer  to  the  house  of  the  dead  man  that  to  the  house  of  the  survivor. 


What  was  the  name  of  the  dead  man? 


c).  Solution  of  the  Nine  Dots  Problem 
Figure  2 


/ 


Diagrams  Depicted  from  Problems  Categorized 
by  So  vices  within  the  Same  Groups 


Novices '  Explanations  for  Their  Similarity 
Groupings 


Problem 


Problem 


Problem 


Problem 


10  (11) 


Novice  2: 


Novice  3: 
Novice  6: 


11  (39) 


Novice  1: 
Novice  5: 
Novice  6: 


7  (35) 


"Angular  velocity,  momentum, 
circular  things" 

"Rotational  kinematics,  angular 
speeds,  angular  velocities" 

"Problems  that  have  something 
rotating;  angdar  speed" 


‘These  deal  with  blocks  on  an 
incline  plane  ” 

“Inclined  plane  problems, 
coefficient  of  friction  ” 

"Blocks  on  inclined  planes 
with  angles" 


Figure  5.  Examples  from  novices'  problem  categories.  Problem  numbers  represent  chapter 
and  problem  number  from  Halliday  and  Resnick  (1974). 


Diagrams  Depicted  from  Problems  Catergorized 
by  Experts  within  the  Same  Groups 


Experts '  Explanations  for  Their  Similarity 
Groupings 


Problem  6  (21) 
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equilibrium 


Problem  7  (35) 


Expert  2:  " Conservation  of  Energy" 

Expert  3:  “Work-Energy  Theorem. 

They  ere  all  straight-forward 
problems." 

Expert  4:  "These  can  be  done  from  energy 
considerations.  Either  you  should 
know  the  Principle  of  Consenarion 
of  Energy .  or  work  i$  lost 
somewhere." 


Problem  5  (39) 


Expert  2:  'These  can  be  solved  by  Sew  ton  s 
Second  Law" 

Expert  3:  "F  •  ma;  Newton  S  Second  Law  ” 

Expert  4:  "Largely  use  F  -  ma;  Newton 's 
Second  Law" 


Problem  12  (23) 


Figure  6.  Example  from  experts'  problem  categories.  Problem  numbers  represent  chapter 
and  problem  number  from  Halliday  and  Resnick. 
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